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Abstract 

We investigate a quantum state of positive charge in DNA. A quantum state of electron hole is 
determined by the competition of the pi-stacking interaction b sharing a charge between different 
base pairs and the interaction A with the local environment which attempts to trap charge. To 
determine which interaction dominates we investigate charge quantum states in various (GC) n 
sequences choosing DNA parameters satisfying experimental data for the balance of charge trans- 
fer rates G + <-> G+, n = 2,3. We show that experimental data can be consistent with theory 
only assuming i < A meaning that charge is typically localized within the single G site. Conse- 
quently as follows from our modeling consideration any DNA duplex including the one consisting 
of identical base pairs cannot be considered as a molecular conductor. Our theory can be verified 
experimentally measuring balance of charge transfer reactions G + <-> G^, n > 4 and comparing 
the experimental results with our predictions. 

PACS numbers: 7080.Le, 72.20.Ee, 72.25.-b, 87.14.Gg 
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I. INTRODUCTION 



Positive charge (hole) transfer in DNA is extensively investigated since its experimental 
discovery This process can be responsible for the oxidative DNA damage 



and is possibly important for DNA repairing [6 



. Also an ability of DNA to promote long 



distant charge transfer can be used in molecular electronics applications [8|]. These studies 
raise the question, whether DNA is a molecular conductor. In molecular conductors (e. 
g. carbon nanotubes) charge is usually delocalized within several sites (monomers). Such 
system possesses a high conductivity similarly to metals. The alternative behavior takes 
place when the charge is typically localized within the single site and it hops to adjacent 
sites due to rare environment fluctuations. Based on the results of direct measurements of 
charge transfer in DNA 9] we argue in this paper that DNA is not a molecular conductor. 

DNA contains two different sorts of base pairs AT and GC forming quasi-random se- 
quences. The lowest ionization potential is attributed to a GC pair (essentially G-base 
10||). Since the electron transfer integral b between adjacent bases does not exceed AT 
- GC ionization potential difference A ~ 0.5eV, the quantum state of charge in a static 
environment will be localized near some G base and its localization length is comparable to 
the interbase distance 
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12j. Interaction with environment breaks down this localization, 
inducing charge hopping between quantum states localized at adjacent G sites. This hap- 
pens because of rare environment fluctuations supporting derealization of charge between 
neighbo ring G bases. Indeed, according to experimental studies and theoretical 

models [UQ the sequence dependent charge transfer in DNA can be represented as the 
series of charge hops between adjacent G bases serving as centers of localized states. An 
addition of AT pair between GC base pairs dramatically reduces the charge hopping rate 
IdjQ, Q] and therefore the optimum base sequence for the efficient charge transfer consists 
of identical base pairs. We are going to study a charge (hole) quantum states in sequences 
of identical GC pairs of various lengths, as most promising candidates for efficient charge 
transfer. 

The thermal energy at room temperature ksT ~ 0.026eV is very small compared to at 
least one of the characteristic energies of the system including the reorganization energy A 
and the electron transfer integral b. The reorganization energy is caused by the electrostatic 
interaction of the hole with the environment and its minimum estimate is A ~ 0.25eV 
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15l . Il6l . 1 171 Il8l . [19|. The electron transfer integral b is associated with the pi-stacking 



interaction of heterocyclic groups in DNA bases. It is defined as the gain in the hole energy 
due to its sharing between two adjacent DNA bases having equal energies with respect to 
the energy of the hole localized within a single G base }18| . The minimum estimate for 
the electron transfer integral between adjacent G bases is around O.leV |l8j]. Thus the 
thermal energy is, indeed, small compared to other characteristic energies and we can use 
the ground state of the hole coupled to environment as the representative state. One should 
notice that the ground state can be used as a representative state, but it strongly differs 
from transition states responsible for the charge transfer. Since the hole transitions are quite 
rare the "typical" charge state can be described ignoring them. 

The spatial size of the hole ground state is determined by the competition of the de- 
localization of charge due to the pi-stacking interaction of heterocyclic groups belonging 
to adjacent bases and the localization caused by the environment polarization around the 
charge. The charge derealization energy is characterized by the effective electron transfer 
integral b and the localization energy is given by the environment reorganization energy A 



15l . Il6l . Il7l . Il8l . Il9| . Derealization of charge over k base pairs leads to the gain in the kinet- 
ics energy E^i ~ —26 + b/k 2 , similarly to the classical problem of particle in the box. The 
ganization energy scales with the size of charge wavefunction as E\ oc ~ —X/k (see e. g. 



reor 



m 



20] ). This dependence can be understood as following. The local environment polariza- 
tion induced by the hole is proportional to the charge density P oc d oc 1/k, where d stands 
for the local charge density. Consequently the interaction in each site having charge e/k 
with the environment polarization is proportional to their product pd oc 1/k 2 and the sum 
of all these k interactions over all k sites results in the dependence \/k. This dependence 
takes place only in a one-dimensional system. Since at large k the reorganization energy 
—X/k dominates over the kinetic energy b/k 2 , the ground states is always localized and the 
localization radius can be estimated using the number of sites k minimizing the total energy 

E tat (k)*-2b + ±-± (1) 

The minimum takes place at = fc* ~ 2b/ X which is the number of DNA bases occupied 
by the hole in its ground state. At zero temperature the hole is localized, while at finite 
temperature it can hop between different states because of its interaction with the fluctuating 
environment. In the translationally invariant system ((GC) n or (AT) n ) the potential barrier 



separating two configurations can be estimated as the energy price for the environment 
fluctuation increasing the size of the wavefunction by one more site compared to its optimum 
state k* — > k* + 1, which can be expressed as E tot (k* + 1) — E tot (k*) Eq. flTJ. Such fluctuation 
creates the transition state and after the relaxation of environment the hole can arrive at 
the new equilibrium centered at different G base. If b ^ X then this activation energy is 
given by approximately a quarter of the reorganization energy A/4 ^> ksT '. Then we expect 
the hole mobility to obey the Arrhenius law as in insulators. In the opposite limit we obtain 
a very small value A ~ A 4 /(166 3 ), which becomes negligible at moderately large b/X. For 
instance if b = leV the potential barrier for the hole transition becomes smaller than the 
thermal energy fcgT m 0.026eV already at A < 0.7eV. In this regime charge transport is 
weakly sensitive to the temperature as in metals. 

Therefore it is very important to determine the true relationship of b and A in DNA. 
This relationship characterizes its conducting behavior. Indeed, in the regime b > X a DNA 
molecule made of identical base pairs would behave as a one- dimensional conductor, while 



in the opposite 

nr 

literature 10. 
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imit DNA cannot be treated as a molecular wire. Existing estimates in 
16 



13, llS lia, l20j do not help much in determining the relationship of b 



and A, because there is a large controversy between different studies^ In particular, various 
estimates for the electron transfer integral b range from 0.05eV 



to 0.5eV 



lOf. Also 



all calculations of b ignore vibrational rearrangements. Polar vibrational modes associated 
with the covalent bonds possess a high vibrational quantum energy hu ~ 0.3eV ^> ksT. At 
room temperature these modes must remain in the ground state during charge tunneling. 
However since the tunneling of hole changes the equilibrium coordinates of polar vibrations 
they must tunnel together with the hole. This leads to the redefinition of the effective 



tunneling amplitude b (if it is sma 



the given vibrational mode 
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ler then the reorganization energy X v ) associated with 



24) as 



J eff 



7 Au 

b ■ e 2fi " 



(2) 



To our knowledge this effect is ignored in all existing theoretical estimates of the electron 
transfer integral. 

The calculations for the reorganization energy A associated with the classical interaction 
with solvent were made using the continuous medium approach. The estimates of A range 
from 0.25eV [ljj] to more than leV [ijj] due to uncertainty in the water dielectric constant 



value near the DNA molecule. It is hard therefore to select the right parameters based on 
existing theoretical results. 

We suggest an alternative method to study the hole quantum state within the DNA 
molecule, using experimental data sensitive to the relationship of two key parameters of the 
theory b and A. Namely, we exploit the rate constants for the balance of charge transfer 
reactions between different {GC) n complexes, measured by Lewis and coworkers [9( 

k GG 

G+ + GG h G + (GG) + , £ = 7.7±1 

k GG -« 

Iggg (3) 

t j GGG 

G+ + GGG +± G + (GGG)+, ^ uu = 20±l. 

k GGG -i 

In the thermal equilibrium these ratios are determined by base pair partition functions 

k GG Z 2 +Zi 



'2 



k GG z 2 z x . 



UGGG 7 7 

3 k GGG Z 3 Z l+ ' { ] 

where Z n+ stands for the partition function of G n sequence containing the single hole in it, 
while Z n is the partition function of the same base sequence, but without the hole. Since G 
and G n complexes are separated by some AT bridge we can ignore their interactions. 

We assumed that the charge transfer between G and G n complexes is very slow compared 
to the charge transfer rate inside the complex. This agrees with experiments showing the 
strong reduction of the charge transfer rate with the length of AT separating GC pairs 
{4]. Consequently the contribution of highly excited states like the transition states for the 
charge transfer between complexes to the partition functions can be neglected so we can 
assume that the local equilibrium of charge within G n sequence is established. Accordingly 
one can use Eq. (j3J) for the ratio of charge transfer rates. 

Both ratios in Eq. (j3J) are determined by parameters b and A and the thermal energy 
at room temperature ksT ~ 0.026eV. Below we calculate both ratios using tight binding 



model for G n comp 



the environment 
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exes and standard linear response theory for charge interaction with 



2jJ . Theory determines the domain of parameters A and b satisfying 



experimental data Eq. (jHJ). We demonstrate that any reasonable choice of X and b, satisfying 
Eq. corresponds to the regime b A, where the hole in its ground state is localized 

essentially in a single G base (see Fig. Qp. 
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The manuscript is organized as following. In Sec. [IT] we formulate and discuss the model 
for the hole in G n sequence coupled to the classical environment. In Sec. II III we derive 
expressions for rate ratios r n . In Sec. IIV Al we discuss the properties of the hole ground 
state in the domain of its parameters consistent with the experiment [9| and show that it is 
localized essentially within the single G site. In Sec. HVBI we discuss the partition functions 
of charge using the simple perturbation theory approach and predict the balance of charge 
transfer reactions between G + state and state for arbitrarily number n of G bases. In 
Sec. IIV CI we discuss the effect of correlations of environment polarizations on different G 
sites on ratios r n . In Sec. IIV Dl we investigate the influence of A bases surrounding G n 



sequences in the experiment [9] on our conclusions. In Sec. |V] we resume our results and 
discuss the experiments, which can verify our theory and help to determine parameters b 
and A more accurately. 

II. MODEL 

The chain of n GC base pairs can be described by the tight binding Hamiltonian coupled 
to the classical environment represented by coordinates Xi, i = 1, ...n for each DNA G site 

H = Hhole + H int , 
n-1 

Hhoie = -b^2(cfc i+1 + cf +1 Ci), 

i=l 



- n n 

Him = 7^^2 X i XiUi > 
i=l 

rii = cfci. (5) 



2A 

i=l i=l 



Here Cj, cf are operators of creation and annihilation of electron hole in a site i. Clas- 
sical coordinates Xi describing the polar environment, including solvent and counterions, 
are directly coupled to the local charge density rii = c^Ci and the parameter A stands for 
the reorganization energy associated with the interaction of charge and its environment. 
The environment energy is expressed as a bilinear form with respect to solvent coordinates, 
which is justified by a standard assumption that polarization fields are small compared to 



atomic fields 21[ so we ignore X 3 terms. Since the potentials X include the contribution of 
counterions it is not clear whether the ion effect can be treated in the linear response approx- 
imation. However, we do not think that the contribution of ions can critically change our 
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assumptions because as was demonstrated by Voityuk and coworkers [25| the contribution 
of ions does not exceed 20% of the overall charge energy fluctuations, while the remaining 
80% of fluctuation are associated with water. As discussed below in Sec. HVBI the validity of 
the linear response approximation does not crucially affect our consideration in the strong 
localization regime. 

We assume that only classical degrees of freedom with excitation energy comparable or 
less than the thermal energy are left in Eq. (jSJ), while high energy modes are integrated 
out. This may lead to the renormalization of parameters in the system Hamiltonian Eq. 



© (see e. g. Eq. © and Refs. [22 



23 



241 ]) and we assume that this renormalization 



is made. We do not include off-diagonal terms XiXj, i ^ j into the Hamiltonian. This 
is somehow justified because they are smaller than the diagonal ones }17| . We will show 
below that for G2 sequence the problem including off-diagonal terms can be reduced to the 
diagonal model Eq. (jSJ) with the replacement of the single site reorganization energy A with 
the reorganization energy for charge transfer between adjacent sites. For GGG sequence the 
similar replacement with removal off-diagonal terms remains a good approximation (see Sec. 
HVCp . Therefore all our results are valid if we consider the more general interaction model 
with the replacement of the single site reorganization energy A with the charge transfer 
reorganization energy A* as described below in Sec. IIHI 

We assume the electron transfer integral h to be independent of the environment fluctua- 
tions. Fluctuations of the electron transfer integral were treated as less significant compared 
to fluctuations in local site energies because the change of the site energy by more than 
the thermal energy strongly modifies the tunneling rate, while the change in the electron 
transfer integral requires the energy fluctuation comparable to the energy 5E ~ H/r where 
t is the tunneling time for the electron transition [26] . This energy 5E can be comparable to 
the barrier height 27j which is much larger than the thermal energy. The above expectation 
conflicts with the molecular dynamics simulations of the electron transfer integral between 
hole states in accent G hases affected h y the interaction with envi— (see Ref. 
and references therein), where the remarkable effect of fluctuations on the electron transfer 
integral has been found. However, molecular dynamics results should be considered with 
care because of the classical treatment of the nuclei motion, which can be partially of the 
quantum mechanical nature. We estimated the effect of the distribution of the electron 
transfer integral b on the ratios of charge transfer rates for the system described by Eq. 
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(jSJ). The distribution of b was approximated by the Gaussian distribution with parameters 
< b >~ 0.046eV and 5b ~ 0.064eV [28] while reorganization energy was taken as A = 0.24eV 



251 ] . The calculations result in the overestimated ratio of charge transfer rates r 2 Eq. 



by the factor of five. This result queries the accuracy of molecular dynamics approximation 
to fluctuations. Also the measurements of charge transfer efficiency between G and GGG 
sequences separated by n AT pairs [29j can be successfully interpreted assuming that the 



30]. 



charge transfer integral is constant at least for n < 5 AT pairs 

In the experiment G n sequence was surrounded by A bases. The addition of A bases 
surrounding G n sequences to our model Eq. (jSJ) can change our results. The changes are 
associated with the possible sensitivity of the hole energy in the G site to its neighbors 
and the ability of the hole to come virtually to the adjacent A site. According to quantum 
chemistry calculations the ionization potential of the G base depends on surrounding bases 



31] . However, the difference of ionization potentials for GG + G, AG + G and GG + A does 



not exceed O.leV. It is remarkably smaller than the reorganization energy and therefore we 
will ignore it. The error of semiempirical methods used to calculate the ionization potential 
is comparable to the maximum calculated effect ~ O.leV so we cannot consider the energy 
change O.leV or smaller. Screening of electrostatic interaction by water can also reduce the 
effect of neighbors on the G + state ionization potential. 

The changes associated with the extension of the charge wavefunction to adjacent A sites 
can be more important. However these changes do not affect our main conclusion about the 
strong localization of the charge ground state as shown in Sec. IIVDI We ignore the second 
strand (C n ) because of the weak coupling between strands [181] and the large difference of G 
and C ionization potentials exceeding leV jiol ]. 



III. EVALUATION OF RATE RATIOS 



We study the ratios of charge transfer rates Eq. (T4]). Each partition function is given by 
Z n = J dXi...dX n Tre~P Hn , where H n is the Hamiltonian Eq. ([5]) describing G n sequence, 
trace is taken only over states with the single hole (Z n+ ) or no holes (Z n ) and (3 = l/{k B T). 
If there is no hole calculations are reduced to multiple evaluation of a Gaussian integral 
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leading to the expression 



Z n = c\ c= J — . (6) 

For the sequences containing a hole an analytical expression can be obtained only for n = 1 

Z x+ = ce^' 2 . (7) 

For any number n of GC pairs calculations can be simplified integrating the partition func- 
tion over a "center of mass" coordinate Xi+..X n , which is coupled to the conserving operator 
of the total number of particles c\c\ + .. + c+c n = 1. In the case n = 2 one can conveniently 
introduce the new coordinates as X = Xi + X 2 and u = X\ — X 2 . Then the Hamiltonian of 
two base pairs (Eq. ([5]) for n = 2) takes the form 

— X 2 X u 2 uirii — n 2 ) , , , , . 

Eigenenergies of this Hamiltonian can be written as 



X 2 X u 2 v? 70 



Accordingly the partition function takes the form 



Z 2+ = -J dXe-*r+T J due~— 2cosh f (3\j— + b 2 J , (10) 

The factor 1/2 is concerned with the coordinate transformation. Performing integration over 
the center of mass coordinate X we get 



Z 2+ = {V2c)eW A J °° due~^ cosh (f3^ + b 2 ) . (11) 



The ratio r 2 of reaction rates Eq. (j3J) takes the form 



r 2 = y ^e"^/ 4 due~^ cosh h^ + b 2 ]. (12) 

Consider GG sequence with off-diagonal interaction of environment coordinates 

Henv = — — — 7; — + AiXiX 2 . (13) 
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The coefficients A can be expressed in terms of the average coordinate fluctuation matrix B 

as 



A A 1 \ = I B B x ' 
At A ) y B x B 

where the elements of the matrix B are defined as [lj 



(14) 



k B TB =< X\ >=< X\ >, k B TB x =< X X X 2 > . (15) 



< ... > means the standard thermodynamics averaging with the Hamiltonian Eq. (fl3l) . 

Then we can still proceed to the new coordinates X and u defined as previously. The 
Hamiltonian can be written as 

~ (A) + A X )X 2 X (A -A x )u 2 uju, - n 2 ) 

H P i = 1 y + 4 2 b{clc 2 + cjci). (16) 

Note that the Hamiltonian for the single site has different definition of the diagonal term of 
the environment energy (n = 0, 1 is the population of the single G site) 

Hs = ^~nX. (17) 

This is because the Hamiltonian must provide the right expressions for average squared 
polarizations in Eq. ( floT) . The partition functions involved in our consideration for r 2 can 
be expressed as 



Z 2+ = ^( g ° + *0 efl*+*O/< / °° due-^2 cosh (d^ + W 

A* = B - B x . (18) 



The expression for the ratio r 2 Eq. (J4]) takes the form 



r-2 



P e -f3{B - Bl ))/4 J +C ° due -^BT) cos h (p^+fA . (19) 



7i(B - B x ) 

This expression has the form identical to Eq. fTTZj) if the reorganization energy A is replaced 
with A* = B — B x . The new parameter A* is the reorganization energy for the charge 
transfer between adjacent sites. This is the relevant parameter which can be used also for 
GGG sequence and more complicated sequences as it will be shown in Sec. IIV CI 
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The expression for r 3 is more complicated. Below we give the result after integration 
over the coordinate of the center of mass X = X\ + X2 + X3 and use two other coordinates 
v — Xi — 2X 2 + X 3 and u = Xi — X 3 in the case of the "diagonal" environment energy Eq. 



r 3 



(3 _£± 
e 3 



4 v / 3ttA 



dv 

—00 j —00 



/3v 2 (iv, 1 , 

due 12X ■ Tr [exp(—(3V 3j 



( — —j — — -b 

-b -- 

u 3 



-b — E _|_ H 
" 2^6 



Similarly one can obtain the expression for r 4 



4 V 2ttA 



e s 



+00 



OO >/ —OO «^ — OO 



/ - + - 

4^2 



-6 

" 4^2 



(20) 



due ^ « • Tr ^exp(— /3X4)J ; 

\ 
-6 
-6 -- - - -b 

V "42 

\ -6 

t = X x - X 2 - X 3 + X 4 , u = X 1 - X 4 , u = X 2 - X 3 . (21) 



Eqs. (fT2]) . (1201) . (pTj) were used for the numerical evaluation of partition functions and ratios 
r 2 , r 3 and r 4 . 



IV. DEFINITION OF PARAMETERS CONSISTENT WITH EXPERIMENT 



A. Numerical results 



We have performed numerical evaluations of ratios in Eq. (J4]) to find domains of pa- 
rameters b and A satisfying Eq. ([3]) and show these domains in Fig. [TJ The upper (lower) 
border of each domain is defined by the maximum (minimum) value of ratios r 2 and r 3 Eq. 
(JH) within the experimental error (8.7 (6.7) for GG and 21 (19) for GGG). The acceptable 
domain of parameters for a GGG sequence fully belongs to the corresponding domain for a 
GG sequence. Thus the domains for GG and GGG base sequences are completely consistent 
with each other. Therefore we cannot determine parameters A and b better then using the 
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"dark" domain for GGG. This information is still sufficient to consider the localization of 
the hole wavefunction in G n aggregates. 

Since the thermal energy k^T « 0.026eV is smaller than at least one of two other charac- 
teristic energies of the system (remember that the minimum estimate for the reorganization 
energy is A ~ 0.25eV 19] ) we can characterize the wavefunction using the system ground 
state at coordinates X\ (i = 1, ...n) minimizing the ground state energy. In the relevant 
domain of parameters in Fig. CD (A > 0.25eV) the ground state wavefunction is centered 
at one of G bases (left or right ones for a GG sequence and the central one for a GGG 



Derealization Localization 




1.0 1.5 
A,(eV) 



FIG. 1: The domains consistent with the experimental ratios of reaction rates Eq. ([3]), dark grey 
for GGG and light grey for GG. Inset shows the fraction of the quantum charge state belonging 
to the central site (Po) vs - the reorganization energy A. Charge transfer integral b is determined 
in the way to satisfy Eq. ([2D. We assumed that the system is nearly in its ground state. 
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sequence). We describe this state by the probability Po for the particle in the ground state 
to be in this central site. This probability can be calculated using the relationship 

= y, (22) 

where % is the central site. This equation can be derived as following. Coordinates Xj 
(j = 1, ...n) in the energy minimum satisfy the condition 



dig 



H 



dX j 



9, 

0, (23) 



where \g > is the wavefunction of the hole ground state at given coordinates Xj. Using Eq. 
((5]) one can rewrite Eq. (123]) as 

= < n 3 > . (24) 

The average population of the central site % (< rii >) is equal to the probability P to find 
the hole there. Accordingly we end up with Eq. fT2"2l . 

Consider the ground state wavefunction for the hole within the GG sequence. The ex- 
pression for the hole ground state energy of the GG sequence at arbitrary coordinates X\, 
X 2 reads E 2 = - - \J {Xl ~* 2)2 + b 2 . In the regime of interest 2b < A (see Fig. 

[[]) the minimum of energy is given by 

^2mm = ~~2 ~ ~\ , 



and it is realized at X% = A/2 ± a/ (A/2) 2 — b 2 = A — X 2 . Accordingly 

Note that if 2A < b the ground state wavefunction is symmetric in the minimum (Xi = 
X 2 = A/2) and the energy of this state is given by 

E2symm = ~ ~ ~ b. (27) 

Consequently Po — 1/2 (see Fig. [T]). In the case of 2b < A this symmetric state is the 

transition state for the charge transfer G + G < > GG + (saddle point in the energy function 

E 2 (Xi, X 2 ) between the energy minima centered in the first and the second G bases). For 
GGG complex the probability Po that the hole resides in the central site has been evaluated 
numerically. 
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Both probabilities Pq obtained for the A - b line corresponding to the ratio r 2 = 7.7 
Eq. (j4]) are shown in the inset in Fig. [TJ It is clear from this graph that for both GG 
and GGG sequences the wavefunction of the hole is essentially localized within the single 
G site. For instance at the minimum value of A ~ 0.25eU we have 85% and 78% of the 
probability to find the particle in that site for GG and GGG sequences, respectively. These 
probabilities increase for A = leV to 96% and 94%, respectively. Thus we come to the 
important conclusion that the wavefunction of hole is essentially localized in the single G 
site for G n sequences. This conclusion differs from the predictions of previous work [7l.fi9.l20l 
where the the quantum state of the hole was represented as a polaron of an intermediate 
range shared between several base pairs. 

In addition to the analysis of the and G% complexes we have performed the calcula- 
tions for G\ complexes for experimental verification of our theory. The partition function 
for G^ complex can be evaluated integrating first the center of mass position similarly to 
our previous calculations. The predictions for the ratio r 4 are shown in Fig. [2] by the solid 
line denoted as n = 4. The parameters A and b for calculations were obtained using the 
experimental condition r^X, b) = 20 Eq. (J3j). Using these parameters we calculated the 
ratio r 2 (solid line marked by n = 2) and ratio r 4 (solid line marked by n = 4) Eq. (1211) . 

According to Fig. [2] the ratio r 4 does not change much in the whole domain of parameters 
A and b satisfying the balance equations Eq. ([3]). We can estimate it to be around 33 and 
this is our prediction for the further experimental verification. 



B. Understanding of the balance of charge transfer reactions 

An impressive consistency between GG and GGG base sequences in Fig. [1] is not acci- 
dental coincidence and can be explained by the strong localization of charge wavef unctions. 
In the regime of strong localization the partition function Z n+ for n > 2 can be represented 
as the sum of n contributions of energy minima corresponding to wavefunctions centered in 
all n G sites with coordinates X realizing the corresponding energy minimum Xi ~ A ^> 
k ytz % for the state centered at site i. Since in the zeroth order approximation in bj A each 
quantum state is localized at one site we can neglect the difference in preexponential factors 
for the case of b = and approximate the partial i th contribution to the partition function 
as Z l n = c n e _/3£; '' ' (see Eq. (JEJ)), where Ef 1 ^ = —A/2 is the energy of the ground state for 
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coordinates X realizing the local minimum in the absence of electronic coupling (b = 0). 
Second order correction in b to the energy = — A/2, which is the first non- vanishing 
correction, is important because it appears in the exponent e~ f3E and it is multiplied by 
the large factor f3. For two states at the edges this correction adds to the hole energy as 
Ei = En ~ —A/2 — 6 2 /A, which coincides with the ground state energy for GG Eq. (1251) . 
For n — 2 remaining states the correction to the energy should be doubled because of the 
addition of contributions of two neighbors so we got E\ x) = -A/2 - 26 2 /A, 1 < i < n. The 



40-, 



30-- 



n=4 



20 



n=3 
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— i 1 1 1 1 1 1 1 1 1 1 

0.0 0.2 0.4 0.6 0.8 1.0 



X(eV) 



FIG. 2: Predictions for the ratio using parameters b and A related by the experimental condition 
r%{b, A) = 20 Eq. ([3|). Solid line denoted by n = 4 shows the predicted value of while two dashed 
lines show upper and lower boundaries for defined in accordance with Eq. (|29p . Results for n = 2 



calculated under the same assumptions are shown for comparison. Upper and lower boundaries for 
T2 are given in accordance with the experiment i|, Eq. ([3]). 
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change in equilibrium environment coordinates can be neglected because near the energy 
minimum it leads to the effect of order of (6/A) 4 . Consequently, we can approximate the 
ratio r n (cf. Eq. 01])) as 

Tn = ^1 « 2e 8b2 / x + (n- 2)e 2 ^/\ (28) 

These predictions are compared with the numerical calculations in Fig. [21 One can con- 
clude that perturbation theory works reasonably well in the parameter domain of interest 
corresponding to A > 0.25eV for n = 2 and n = 3. 

In particularly, it follows from Eq. (1251) that (Z 3+ Z 1 /(Z 3 Zi + ) « (Z 2+ Z 1 /(Z 2 Zi + ) + 
((Z2+Zi/(2Z 2 Zi + )) 2 ). Indeed, this relationship is satisfied for the experimental values of 




FIG. 3: Comparison of perturbation theory approaches with numerical simulations for ratios n 
and r 2 - Curves show predictions of Eq. (|28p for parameters A and b chosen to satisfy Eqs. (|12[) . 
(pOD . respectively (r 2 (A,6) = 7.7, r 3 (A,6) = 20). 
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ratios within the accuracy of the experiment. This explains the consistency of domains for 
GG and GGG Fig. [TJ Using Eq. (128]) one can predict that ratios r n form arithmetic series, 
which can be expressed as 

r n = 7.7 +12.3- (n-2). (29) 

For balance between G and GGGG sequence we predict the ratio = 2r% — r2 = 32.3 ± 2. 
This estimate agrees with our numerical calculations for the (GGGG) + partition function 
and reaction rate ratio (see Fig. [2]). 

The perturbation theory analysis described above is approximately valid even if the energy 
of solvent fluctuations Eq. (jSJ) can not be expressed as the bilinear function of coordinates 
X. One can still define the reorganization energy A as the energy difference of adjacent G 
bases in G + G sequence induced by the environment polarization around the charge localized 
at the only one (left) base. Then all our conclusions about localization remain valid, but 
the potential barriers for charge transfer will be larger than in the case of the "harmonic" 
potential (A/4) because higher order nonlinear terms make the barrier sharper. 

Since perturbation theory works reasonably well in the parameter domain of interest, we 
will use it to study more complicated questions of the charge transfer reaction balance in 
the presence of off-diagonal interaction of solvent polarizations (cf. Eq. (flBl ) and in the 
presence of A bases surrounding G n complexes in experiments [9] . This analysis is reported 
below in Sees. IIVCI IIVDI respectively. 



C. Perturbation theory in case of off-diagonal correlations of environment polar- 
izations 

The most general Hamiltonian of environment in the bilinear approach can be expressed 



as 



1 - 

Henv = 7^ ^jXiXj, (30) 

where the interaction matrix A is defined as the inverse correlation matrix B (cf. Eq. ( TT41) ) 

A = B~ x , Bij = B\i_j\ = — ^-^ — . (31) 



Here average is defined as the thermodynamic average with the Hamiltonian Eq. ( 1301 . 

Consider the single hole problem using the perturbation theory with respect to the elec- 
tron transfer integral b. Then in the zeroth order approximation we set b = and assume 
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that the hole is localized in some G base numbered by index k. This problem can be 
described by the Hamiltonian 

1 - 

Hint = 2 AijXiXj — Xk- (32) 

The energy minimum is realized at the point where all n derivatives of energy Eq. (I3"2"j) with 
respect to X^ (i = 1, ..n) are zeros Y^j=\ AijXj = 5^. The solution to this equation is given 
by 

xP = B\i- kU (33) 

and the energy minimum is = — 

The first non-vanishing correction to the energy due to the small but finite electron 
transfer integral b appears in the second order of perturbation theory and can be expressed 
as 

i=l A i - A fc 

where the symbol is equal if i and k are non-neighboring G - sites or it is equal 1 
for neighboring G sites. Using the zero-point approximation for equilibrium coordinates Eq. 
((33]) we get (cf. Eq. (JTSJ) 



,2 



= ~ z ir-R = -4- (35) 

-DO — -Dl ^* 

where z is the number of neighboring G sites for the center of charge localization. If this 
site is in the edge of the £7+ sequence we get z = 1 and if it is in the middle we get z — 2. 
Thus energies of all representative states for sequences can be expressed within the 
second order of perturbation theory similarly to energies for the diagonal interaction (see 
Sec. HVBj) . The preexponential factors in all partition functions in the zeroth order in the 



electron transfer integral b are given by factors 1 / \J det A, which are all the same in the top 
and the bottom of ratios r n . Therefore one can still use Eq. (128]) replacing the single site 
reorganisation energy A with the charge transfer reorganization energy 

A* = B - B 1 . (36) 

This approximation is applicable in the regime of the strong localization. The approximate 
relationship between parameters of the system can be obtained using Eq. (128]) with n = 3 
where r n Pd 20. Solving this equation for the parameter in exponent we get 

b 2 

— w 0.033eV. (37) 
A* 
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This is the approximate analytical relationship between the adjacent site reorganization 
energy and electron transfer integral. 



D. Effect of adjacent A bases 

In the experiment each G n complex is always surrounded by A bases. Ionization 



potential of A base exceeds the one for the G base by approximately 0.5eV [3jj. This 

potential difference is much larger than the estimated value of the electron transfer integral 

b (see Fig. [Q) so we do not expect the crucial effect due to the coupling of G and A bases. 

The coupling strength of adjacent G and A bases has been estimated in Ref. [3]. For the 

equilibrium twisting angle 9 « 36° the overlap integral for the GA pair is approximately the 

same as for the GG pair, while for AG pair it is about two times smaller (data are given for 

5' — 3' strand). If we take this effect into account using the perturbation theory approach 

then the ratios r 2 and r% take the form 

/ (3b 2 (3b 2 \^ / (3b 2 (3b 2 
r 2 ~ exp — + — — + exp — — — + — — 

f2(3b 2 (3b 2 (3b 2 \ 

These equations can be resolved for A* and b. We obtained A* = 0.66eV and b = 0.18eV. 
Clearly this estimate corresponds to the regime of the strong localization. The expectation 
for the electron transfer integral in the absence of A bases for A* = 0.66eV is b = 0.15eV 
(see Fig. [1]) so there is a deviation between two solutions by about 20%. However we 
cannot take this estimate quite seriously because (1) Our estimate for the electron transfer 



integral b = 0.18eV exceeds the one in Ref. 18] by almost the factor of 2 (2) If we include 



the experimental error into our estimate then both parameters b and A* will not strongly 
deviate from Eq. (I37j) . while they may vary remarkably compared to the above estimate. 
Therefore the consideration of A bases seems to be the excess of accuracy. 

We believe that the effect of A bases can be clarified experimentally. In the sequence 
TG+T it will be more justified to ignore surrounding T bases then to ignore A bases in 
the sequence AG^A studied in Ref. [9j. This is because the ionization potential of T base 
exceeds that for G base by the factor of 3 more than the one for A base. Therefore the 
measurements of ratios r n for G sequences separated by T bases will clarify the effect of 
neighboring bases and possibly lead to better estimates for b and A. 
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One should notice that both the addition of A bases and the correlations of environment 
coordinates into consideration does not change our prediction that the ratios of reaction 
rates r n form the arithmetic series Eq. (I2"9j) in the case of the strong localization of charge. 

V. CONCLUSION 

We have considered the quantum state of the positive charge (hole) in poly-G - poly-C 
base sequence. We have studied a very simple but quite general model of G n sequence 
characterized by two parameters including electron transfer integral b and reorganization 
energy A. Our choice of parameters b and A was determined by the the comparison of theory 
and experiment for the ratios r n (n = 2, 3) of the charge transfer rate between single G and 
G n complexes. It turns out that the agreement with the experimental data for the ratios r n 
Eq. (jlj), Ref. 9j can be attained only assuming the strong localization of charge within a 
single G-base. The charge in DNA then behaves as a small polaron with the size less than 
the interbase distance. Charge hopping takes place due to rare environment fluctuations. 
Charge transfer rate is determined by the probability of such fluctuation and should obey 
the Arrhenius law. Based on our theory we predict that ratios of charge transfer rates form 
the arithmetic series r n = 7.7 + 12.3 • (n — 2). We propose the experiment measuring the 
ratios of rates for n > 4 as a direct verification of our theory. However we cannot uniquely 
identify specific values of the electron transfer integral b and the reorganization energy A. 
Measurements of similar ratios in TG n T sequences can be used to verify the theory and to 
make better parameter estimates. 

In addition one can attempt to determine these parameters by measuring the temperature 
dependence of the charge transfer rate through poly-G - poly-C base sequence. We expect 
that this temperature dependence can be described by the Arrhenius law with the activation 
energy defined by the difference of charge symmetric transition state energy within (GG) + 
base sequence Eq. f )27j) and the charge ground state energy for (GG) + state Eq. fl25|) 
Ea — A/4 — b + b 2 /A [32|] . Since another relationship between b and A is known from the 
ratio of charge transfer rates r 2 and r 3 (see Fig. [1]) this information will be sufficient to 
find both parameters. Note that the temperature range where such dependence can be 
measured is quite narrow (380K< T < 410K) so it will be hard to prove the Arrhenius 
law based on such experimental data. Therefore it can be only assumption that this law is 
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indeed applicable. Despite of the narrow temperature range the remarkable change in the 
charge transfer rate is expected with growing the temperature from its minimum 380K to 
its maximum 410K. For instance if A = 0.3eV this rate will increase by the factor 1.5 with 
increasing temperature, while if A = leV then the rate will increase by the factor 3.8. 

This work is supported by the NSF CRC Program, Grant No. 0628092. The authors 
acknowledge Frederick Lewis, Michael Wasielewski, George Schatz, Thorsten Fiebig, Yuri 
Berlin and Mark Ratner for useful discussions. 
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